fix(datasets): increase create version request timeout by jared-paperspace · Pull Request #389 · Paperspace/gradient-cli

jared-paperspace · 2022-06-22T02:35:23Z

After some digging, it turns out that larger files were timing out due to the default timeout of 5 seconds. Since the errors are happening inside of the worker pool, they are never reported to the user.

Increases the default timeout to 5 minutes per 15MB file
Better use of the requests.Session() context so that connection pooling is used and connections to a given host are maintained between requests. This should yield slightly better performance for parallel uploads.
Report progress to the user more often (every chunk). There is still much room for improvement here :)

Screenshots

Screen.Recording.2022-06-21.at.8.25.35.PM.mov

jared-paperspace · 2022-06-22T02:36:11Z

gradient/commands/datasets.py

+                headers.update({'Content-Size': '0'})
+                r = session.put(url, data='', headers=headers, timeout=5)
+            # for files under 15MB
+            elif size <= (15e6):


to 15MB from 500MB

jared-paperspace · 2022-06-22T02:36:24Z

gradient/commands/datasets.py

+            elif size <= (15e6):
+                with open(path, 'rb') as f:
+                    r = session.put(
+                        url, data=f, headers=headers, timeout=300)


Increase timeout from 5 seconds to 5 minutes

jared-paperspace · 2022-06-22T02:36:43Z

gradient/commands/datasets.py

+                                presigned_url,
+                                data=chunk,
+                                headers=headers,
+                                timeout=300)


Increase timeout from 5 seconds to 5 minutes

maybe this timeout can be pulled out too... 🤷

jared-paperspace · 2022-06-22T02:36:49Z

gradient/commands/datasets.py

+                            part_res = session.put(
+                                presigned_url,
+                                data=chunk,
+                                headers=headers,


Add headers

jared-paperspace · 2022-06-22T02:37:07Z

gradient/commands/datasets.py

+                        # console! Which again, jank and noisy, but arguably
+                        # better than a task sitting forever, never either
+                        # completing or emitting an error message.
+                        print(


Report every chunk

This isn't too noisy now that we removed branch around it?

No, I don't think the previous one was noisy enough honestly. It feels like nothing is happening. In general, we just need a better progress bar.

jared-paperspace · 2022-06-22T02:37:23Z

gradient/commands/datasets.py

-                content_type=result['mimetype'],
-                dataset_version_id=dataset_version_id,
-                key=result['key'])
+        with requests.Session() as session:


Use connection pooling from urllib3. Previously we weren't utilizing this feature, only the context.

brodeyn · 2022-06-22T02:41:35Z

gradient/commands/datasets.py

+                # We can dynamically assign a larger part size if needed,
+                # but for the majority of use cases we should be fine
+                # as-is
+                part_minsize = int(15e6)


might be worth moving this byte size to a 'constant'. i see it referenced above too.

brodeyn

looks good just a few nits

PSBOT · 2022-06-22T14:28:46Z

🎉 This PR is included in version 2.0.5 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

ghost · 2022-08-13T02:25:21Z

gradient/commands/datasets.py

+                    # we +2 the number of parts since we're doing floor
+                    # division, which will cut off any trailing part
+                    # less than the part_minsize, AND we want to 1-index
+                    # our range to match what AWS expects for part
+                    # numbers


Is this true? Shouldn't you use ceil? I have a gradient version create and gradient files put where it crashes when there is a file that evenly divides the 15mb chunk size (75mb). I suspect that it's trying to read an extra chunk that doesn't exist. The progress bar says 90mb/75mb when it crashes. Does that make sense? I could be misreading things.

Also I think you mean to say that you 1+index because you start counting from 1 and you need to correct for range. Not entirely sure. Could you check this out?

@jared-paperspace

Looks like this was a known bug in a previous PR but intentionally left in. 🥇 The effect of reading past the end of the file was not predicted though.

#384 (comment)

fix(datasets): increase create version request timeout

f17e341

jared-paperspace requested review from bbatha and brodeyn June 22, 2022 02:35

jared-paperspace self-assigned this Jun 22, 2022

jared-paperspace commented Jun 22, 2022

View reviewed changes

brodeyn reviewed Jun 22, 2022

View reviewed changes

brodeyn approved these changes Jun 22, 2022

View reviewed changes

add constants

f03c291

jared-paperspace merged commit c486182 into master Jun 22, 2022

jared-paperspace deleted the jlunde/pla-1127-dataset-upload-cli-command-not-working branch June 22, 2022 14:25

PSBOT added the released label Jun 22, 2022

ghost reviewed Aug 13, 2022

View reviewed changes

ghost mentioned this pull request Aug 15, 2022

Replace faulty chunk counting logic #391

Merged

Conversation

jared-paperspace commented Jun 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Screenshots

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jared-paperspace Jun 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

brodeyn left a comment

Choose a reason for hiding this comment

Uh oh!

PSBOT commented Jun 22, 2022

Uh oh!

ghost Aug 13, 2022 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ghost Aug 13, 2022 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jared-paperspace commented Jun 22, 2022 •

edited

Loading

jared-paperspace Jun 22, 2022 •

edited

Loading

ghost Aug 13, 2022 •

edited by ghost

Loading

ghost Aug 13, 2022 •

edited by ghost

Loading